Efficient Cycle-Accurate Simulation of the Ultrasparc III CPU

نویسندگان

  • Peter E. Strazdins
  • Bill Clarke
  • Andrew Over
چکیده

This paper presents a novel technique for cycleaccurate simulation of the Central Processing Unit (CPU) of a modern superscalar processor, the UltraSPARC III Cu processor. The technique is based on adding a module to an existing fetch-decode-execute style of CPU simulator, rather than the traditional method of fully modelling the CPU microarchitecture. It is also suitable for accurate SMP modelling. The main functions of the module are the simulation of instruction grouping, register interlocks and the store buffer. Its simple table-driven implementation permits easy modification for exploring microarchitectural variations. The technique results in a 40% loss of simulation speed, instead of a 10 times or greater performance loss by fully implementing the detailed micro-architecture. The technique is validated against an actual UltraSPARC III Cu processor, and achieves high levels of accuracy over a range of scientific benchmarks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UltraSPARC-II/: expanding the boundaries of a system on a chip

he central mission of the UltraSPARC-IIi is optimized price/performance and ease of use for the system designer. Stated differently, the CPU must deliver a lot of performance for the least impact on overall system cost and also enable simplified system design. An important strategy in enabling low-cost SPARC-based systems is to leverage the PC industry economy of scale by using the industry-sta...

متن کامل

Evaluation of CPU architecture by simulation technologies and benchmark computer systems

The processor architecture designers face major challenges to improve the processor’s performance. To measure the performance of the processor there are many parameter like performance of cache, TLB, IO operations, bus speed etc. Different companies launch series of processors with same base configuration and a little change of variations base on cache size, cache levels, share and separate cac...

متن کامل

Modelling Time in Full System Simulation

Simulation of computer systems is becoming increasingly important in areas such as code optimization, performance evaluation and processor and memory system design. An important trade-off in simulator design is performance vs. abstraction of the model. Simics is a full system simulator, capable of simulating an entire computer system, including the processor, TLB, MMU, memory hierarchy and surr...

متن کامل

Table 2: Cpu times on an Ultrasparc 2 C Omputer, Number of Oating Point Operations And

This paper introduces a fast and high performance subsampling method using feedforward neural networks. Experimental results using still images are evaluated subjectively and objectively. The results show that our method outperforms, in all cases taken into account, the standard lowpass ltering and subsampling method. Our method's main advantages are high speed, good quality of the nal images w...

متن کامل

Data Access Characteristics and Optimizations for Sun UltraSPARC T2 and T2+ Systems

Processor and system architectures that feature multiple memory controllers and/or ccNUMA characteristics are prone to show bottlenecks and erratic performance numbers on scientific codes. Although cache thrashing, aliasing conflicts, and ccNUMA locality and contention problems are well known for many types of systems, they take on peculiar forms on the new Sun UltraSPARC T2 and T2+ processors,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007